Very low bit rate wireless video telephony using face coding

نویسندگان

  • Mikael Persson
  • Johan Strömbeck
چکیده

In this thesis a novel video telephony compression scheme is proposed, implemented and discussed. The scheme generates a talking head sequence from a head and shoulder video telephony sequence. The generated talking head mimics the facial expressions of the individual depicted in the head and shoulder input sequence. The scheme is based on model based coding and more specifically based on an eigenspace approach. The model which is used to represent the objects to be encoded is statistically derived as the principal components of a training sequence depicting the individual performing a wide range of facial expressions. The thesis introduces the concept of eigenfeatures as used in video compression and a method for encoding the facial expressions of the talking head as a number of coefficients defining a linear combination of the eigenfeatures. Using the proposed scheme acceptable video telephony can be achieved at data rates as low as 3-4 kBit/s. Acknowledgements First of all, my thanks go to my supervisors, Henrik Storm, for inspiration, ideas and support and Fredrik Kahl for ideas and support especially regarding the mathematically intensive parts. Also thanks to Johan Strömbeck and Karl-Anders Johansson for their endless stream of mostly good ideas. Especially thanks to Summus, Inc. (USA), Raleigh, North Carolina, for funding this project. Finally, thanks to all my colleagues at TAT Movide AB where most of my work has been performed.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Very Low Bit-Rate Video Coding by Combining H.264/AVC Standard and 2-D Discrete Wavelet Transform

In this paper, we propose a new method for very low bit-rate video coding that combines H.264/AVC standard and two-dimensional discrete wavelet transform. In this method, first a two dimensional wavelet transform is applied on each video frame independently to extract the low frequency components for each frame and then the low frequency parts of all frames are coded using H.264/AVC codec. On t...

متن کامل

A robust, scalable, object-based video compression technique for very low bit-rate coding

| This paper describes an object-based video coding scheme that was proposed as part of the Texas Instruments' proposal to the emerging ISO MPEG-4 video compression standard. This technique achieves eecient compression by separating coherently moving objects from stationary background and compactly representing their shape, motion and the content. In addition to providing improved coding eecien...

متن کامل

Burst-by-burst Adaptive Joint-Detection CDMA/H.26L Based Wireless Video Telephony using TTCM and LDPC Codes

A low bit-rate video coding techniques using the H.26L standard codec for robust transmission in mobile multimedia environments are presented. For the sake of achieving error resilience, the source codec has to make provisions for error detection, resynchronization and error concealment. Thus a packetization technique invoking adaptive bit-rate control was used in conjuction with the various mo...

متن کامل

A Full-Fuzzy Rate Controller for Variable Bit Rate Video

In this paper, we propose a new full-fuzzy video ratecontrol algorithm (RCA) for variable bit rate (VBR) videoapplications. The proposed RCA provides high qualitycompressed video with a low degree computational complexity.By controlling the quantization parameter (QP) on a picturebasis, it produces VBR video bit streams. The proposed RCAhas been implemented on the JM H.264/AVC video codec andth...

متن کامل

Highly scalable wavelet-based video codec for very low bit-rate environment

In this paper, we introduce a highly scalable video compression system for very low bit-rate videoconferencing and telephony applications around 10–30 Kbits/s. The video codec first performs a motion-compensated three-dimensional (3-D) wavelet (packet) decomposition of a group of video frames, and then encodes the important wavelet coefficients using a new data structure called tri-zerotrees (T...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003